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Amendments to Claims 

Claims 1-34 Cancelled 

35. (Currently amended) A method of discovering one or more patterns in a set 
of k sequences of symbols, called a k-tuple, where k is greater than or equal to two, 
within an overall set of w sequences having sequence numbers 1 , 2, . . ., w , the symbols 
being members of an alphabet, each sequence of symbols having respective lengths Lj, 
Ij2, . . Lw» comprising the steps of: 

a) translating the sequen ces of symbols into a t abl o o f ordered (symbol, position 
index) pairs, where the position index refers to the location of th e symbol in a sequence; 

b) for each of the w sequences, grouping the (symbol, position index) pairs by 
symbol to form a respective master offset table, thus creating w master offset tables; 

c) using the position indices in the w master offset tables to determine the 
difference^in-position value between each occurrence of a symbol in on e of the sequences 
and each occurrence of that_same_symbol in the other sequence in each master offset table 
> forming a k-tuple table associated with the k-tuple ? the table comprising k columns, one 
of the k columns being a primary column and the remaining (k-1) columns being suffix 
columns, each column corresponding to one of the k sequences; 

i) the primary column comprising the (symbol, position index) 
pairs of a primary sequence, 

ii) the (k-1) suffix columns comprising (symbol, difference-in- 
position value) pairs, where the difference-in-position values are the * 
position, differences between all like same symbols of each remaining 
sequence of the tuple and the primary sequence of the tuple, 

iii) the rows in the k-tuple table resulting from forming all 
combinations of Itiee same symbols from each sequence; 

d) creating a sorted k-tuple table by performing a muJti-key sort on the k-tuple 
table, the sort keys being selected respectively from the difiference-in-position values of 
the last suffix column (k* column) through the difference-in-position value of the first 
suffix column; and 

e) defining a set of one or more patterns by collecting adjacent rows of the sorted 
k-tuple table whose suffix columns contain identical difFerenee-in-position values, the 
relative positions of the symbols in each pattern being determined by the primary column 
position indices, the sefrof one or more patterns being common to the k sequences. 

36. (Original) The method of claim 35 further comprising: 

f) deleting all patterns not satisfying a predetermined criteria 

37* (Original.) The method of claim 35 further comprising: 
f) deleting all patterns shorter than a first predetermined span and longer than a 
second predetermined span. 

38. (Original) The method of claim 35 further comprising: 

f) deleting all patterns having fewer than a predetermined number of symbols. 
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39. (Original) The method of claim 35, further comprising the step of deleting 
rows from the k-tuple table which do not have suffix indices identical to any other row of 
the k-tuple table. 

40. (Original) The method of claim 35 further comprising the step of deleting 
rows from the k-tuple table according to predetermined criteria. 

41 . (Previously amended) The method of claim 40, wherein rows sharing 
identical suffix column difference-in-position values are deleted from the k-tuple table if 
there axe fewer than N* such rows, where N s is the minimum number of symbols per 
pattern. 

42. (Currently amended) A method of discovering one or more patterns in a set 
of k+1 sequences of symbols, called a (k+1 Vtuole. where k is greater than or equal to 
two, within an overall set of w sequences having sequence numbers L 2^ .... w , the 
symbols being members of an alphabet, each sequence of symbols having respective 

lengths Li, L? Lr by first forming a k-tuple table and then forming a (k-t-l)-tuple table 

by combiningjhe y - whoroin a k-tuple table iq - o e mbiaod with e an additional sequence 
of symbols, the formation of the_k-tuple_table comprising the steps of: 

a) tran slating the sequences of symbols into ordered (symbol, position index") 
pairs, where the position index refers to the location of the symbol in a sequence: 

b) for each of the w sequences, grouping the (svmboL position index) pairs bv 
symbol to form a respectiye_master_offeet table^thus_creating w mastey_oifeet_tables; 

c) using the positionJndices_in_the w masterjpff^ 
difference-in-position value between each occurrence of a symbol in one of the sequences 
and each occurrence of that same symbol in the other sequence in each master offset table 
. forming a k-tuole table associated with the k-tuple. the table comprising k columns, one 
of th e k columns being a primary column and th e remaining fle-n columns being suffix 
columns* each column corresponding to one of the k sequences: 

ft the primary column comprising the (symbol, position index) 
pairs of a primary sequence. 

ii) the flc-1) suffix columns comprising (symbol, difference-in- 
position value) pairs, whereJhe_differ^cej^^ values are the 
position differences between all like same symbols of each remaining 
sequence of the tuple and the primary sequence of the tuple. 

iii) the rows in the k-tuple table resulting from forming all 
combinations of Kke same symbols from each sequence: 

d) creating a sorted k-txmle_table by performing a multi-key sort on the k-tuple 
table^the sort keys being selected respectively from the difference-in-position values of 
the last su ffix colum n (k th column) through the difference-in-position value of the first 
suffix column: 

thejbrmation of the (k+1.)-tuple table comprising the steps of: 
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«) §X translating the additional sequence of symbols into a tabl e of ordered (symbol, 
position index) pairs, where the position index refers to the location of the symbol in the 
additional sequence of symbols; 

b) & grouping the (symbol, position index) pairs by symbol to form a master 
offset table; 

e) g) creating the (k-H)-tuple table of k+1 columns, one of the k+1 columns being 
a primary column and the remaining k columns being suffix columns, by; 

i) forming all combinations of lake same symbols between the 
primary column of the k-tuple table and the roaster offset table, 

ii) for each such combination, duplicating the corresponding row of 
the k-tuple table, and appending a (symbol, difference-in-position value) pair 
corresponding to the difference between the position index of the master 
offset table and the position index of the primary column • - 

h) creating a sorted (k+1 Vtuple table by performing a multi-key sort on the flc+lV 
tuole table, the sort kevs being selected respectively from the difference-in-position 
values of the last suffix column ITk+lV* 1 column] through the difference-in-position value 
of the first suffix column; and 

i) defining one or more patterns bv collecting adjacent rows of the sorted (k-f-1 V 
ttrole table whose suffix columns contain identical difference-jn-position values, the 
relative positions of the symbols in each pattern being determined bv the primary column 
position indices, the one or more patterns being common to the Ic+1 sequences . 

43. (Currently amended) The method of claim 42 further comprising the step of: 
ii deleting patterns from a. k-tuple table common to the k-tuple table and a (k+1)- 

tuple table, where the (k+l)-tuple table contains all of the sequences of the k-tuple table 
with one additional sequence, by: 

i) e) deleting the suffix column corresponding to a sequence not shared 
between the two tuple tables, thereby defining a modified table, and 

ii) b) deleting aU rows from the k-tuple table whose suffix columns all 
contain identical sets of difference-in-position values to a row of the modified 
table. 

44. (Currently am ended) A method of discovering one or more patterns in a set 
of k sequences of symbols, called a k-tuple, comprising the steps of: 

a) for a first pair of sequences of the k-tuple 

i) translating each sequence of symbols into e - toble - p f ordered 
(symbol, position index) pairs, where the position index of each (symbol, 
position index) pair refers to the location of the symbol in the sequence; 

ii) for each of the paired sequences, grouping the (symbol, position 
index) pairs by symbol to respectively form a first master offset table and a 
second master offset table; 

iii) forming a Pattern Map comprising an array having (LI + L2 - 
1) rows by: 

A) subtracting the position index of the first master 
offset table from the position index of the second master 
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offset table for every combination of (symbol, position 
index) pair having )ike same symbols, the difference-in- 
position value resulting from each subtraction defining a 
row index; 

B) storing each (symbol, position index) pair from 
the first master offset table in a row of the Pattern Map, the 
row being defined by the row index, until all (symbol, 
position index) pairs have been stored in the Pattern Map; 

iv) defining a parent pattern by collecting symbols having the 
identical difference-in-position value from each row of the Pattern Man 
and populating an output array with the collected symbols pf-oaek 
(symbol, positi on index) pair of a row of th e Pattern Map , the symbols 
being placed at relative locations in the parent pattern indicated by the 
position index of the (symbol, position index) pair, and 

v) repeating step iv) for each row of the Pattern Map; 

b) storing the, discovered patterns as arrays of (symbol, position index) pairs; 

c) for each subsequent pair of sequences of the k-tuple, replacing the (symbol, 
position index) pairs of the first sequence of the pair of sequences by the (symbol, 
position index) pairs of the stored patterns; and 

d) repeating steps (a) through (c) for each subsequent pair of sequences until the 
k-th sequence of the k-tuple is reached. 

45. (Currently amended) The method of claim 35, further comprising the step of 
finding all patterns at all levels of support within a set of sequences by: 

f) forming a tree of nodes, where each node corresponds to each combination of k 
sequences, and therefore represents a k-tuple, and wherein each node representing a k- 
tuple is connected to all nodes representing (k+l)-tuples, 

each (k-H)-tuple being, formed by adding a unique sequence to the k-tuple, where 
the sequence being added is later in the ordered list of sequences than the latest sequence 
of the k-tuple; 

g) traversing the tree, and at each node visited during traversal, defining a set of 
one or more patterns by collecting adjacent rows of the sorted k-tuple table whose suffix 
columns contain identical sets of difference-in-position values, the relative positions of 
the symbols in each pattern being determined by the primary column position indices, the 
flet-ef one or more patterns being common to the k sequences, 

46. (Original) The method of claim 45, wherein the traversal of the tree of nodes 
is accomplished via recursion. 

47. (Previously amended) The method of claim 45, further comprising the step 

of: 

h) removing duplicate patterns at each level of support. 

48. (Previously amended) The method of claim 47, wherein the removal of 
duplicate patterns at each level of support step h) is accomplished by: 
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i) for each node corresponding to a (k+l)-tuple, identifying the nodes 
containing k-tuples whose sequences are subsets of the (k+l)-tuple; thereby 
defining a set of causally-dependent nodes; 

ii) locating said causally-dependent nodes; 

iii) removing from each said causally-dependent node the patterns in 
common with the (k+l)-tuple; and 

iv) if the k-tuple table in a causally-dependent node is thereby reduced to 
zero length, removing the corresponding node and all of its descendants from the 
tree prior to their traversal. 

49. (Previously amended) The method of claim 48, therein locating causally- 
dependent nodes in step it) comprises the steps of: 

(A) organizing the nodes at level k in the Tuple-tree into a linked 
list which is ordered from left to right in accordance with the sequence 
numbers represented by each tuple; and 

(B) searching said linked list for nodes which are causally- 
dependent on a particular (k+l)-tuple. 

50. (Original) The method of claim 48, wherein the nodes located in step ii) are 
causally-dependent nodes at level k determined with respect to another node at level k, 
and are thus causally-dependent on a child of the another node at level k. 

51. (Previously amended) The method of claim 47, wherein the removal of 
duplicate patterns at each level of support step h) comprises the steps of: 

i) organizing the nodes at level k in the Tuple-tree into a linked list which 
is ordered from left to right in accordance with the sequence numbers of each 
tuple; 

ii) for each pattern, in the current node at level k, storing a <tf hit list" array 
of the sequence numbers of the sequences containing the pattern ; 

iii) for all nodes to the right of the current node whose sequence numbers are 
all in the bit list, searching for a duplicate instance of the pattern, and if found, 
eliminating it; and 

iv) making each node the current node, repeating steps (ii) and (iii), in the 
order of the node's appearance in the linked list. 

52. (Previously amended) The method of claim 51, wherein, in step iii), the 
nodes consistent with the hit list are found using a hash tree, the hash tree having a root 
and fc levels of nodes, the k-tb level of th e hash tree having a plurality of leaf nodes, the 
respective level of nodes of the hash tree corresponding to the respecti ve sequence 
numbers of a k-tuple, the leaf nodes identifying the k-tuple whose sequence numbers 
correspond to the path from the root to the leaf node, wherein 

searching the nodes for pattern duplicates is performed by repeating steps ii) and 
iii) for each node tn the order of th e appearance of that node in the hash tree* 
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53. (Currently amended) The method of claim 45 wherein the traversing step g} 
o) itself comprises the steps of: 

i) creating a Virtual Sequence Array of patterns found within the sequences, 
wherein the patterns are termed P-nodes and the tuple nodes are termed T-nodes, 

(ii) finding a P-node list corresponding to the location of each pattern in the 
primary sequence of that tree node, 

iii) searching the P-node list for a duplicate instance of the pattern, 

(A) if no duplicate is found: 

(1) adding a pointer to the pattern of the current T-node 
pattern array, 

(2) finding all locations of the pattern within the Virtual 
Sequence Array, 

(3) adding a pointer to the pattern to each corresponding P- 
node array; 

(B) if a duplicate pattern is found: 

(1) ignoring the pattern if the duplicate pattern was found at 
support equal to the current level of support, 

(2) if the duplicate pattern was found at a previous level of 
support, unlinking the duplicate pattern from its previous T-node (if 
it exists), and relinking the duplicate pattern to the current T-node, 

(3) repeating steps 1) and 2) until all of the children of a T- 
node have been created, thus insuring that patterns of that T-node 
that are at their ultimate level of support are reported* and 

(4) deleting the T-node. 

Claims 54-65 Cancelled 

66. (Currently amended) A computer-readable medium containing a plurality of 
data structures useful, in controlling a computer system to discover a set of one or more 
patterns in k sequences of symbols within an overall set of w sequences, the plurality of 
data stxuct^es_cornt>rising: 

a number w of master offset table data structures each grouping, 

for each value of a difference in position between each occurrence of a 
symbol in one of the sequences and each occurrence of that same symbol in 
each other sequence, 

the position (position index) in the first sequence of each symbol 
therein that appears in each of the other sequences at that difference-in- 
position value; 

a first - additionai k-tupte table data structure comprising columns and rows, the 
columns comprising (symbol, position index) pairs and (symbol, dirJerence-in-position 
value) pairs; and 

a se cond additional sorted k-tur)le table data structure comprising a row-sorted 
representation of the (symbol, position index) pairs and (symbol, difiference-in-position 
value) pairs contained in the first - additional k-tuple table data structure A 
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wh erem adj acertt rows of the sorted k-tuole table data structure whose suffix 
columns contain identical difference-in-position values 

define one or more patterns of symbols, the relative positions of symbols m each pattern 
being determined bv the primary column position indices in the sorted k-tuple table data 
structure . 

67. (Currently amended) The computer-readable medium of claim 66 wherein 
the se cond additional somdJk^tuple table data structure further groups, for each value of a 
difference in position, the number of symbols in. the first sequence that appear m the 
second sequence at that'difference-in-position value. 

68. (Currently Amended) A computer-readable medium containing instructions 
for controlling a computer system to discover one or more patterns in a set of k sequences 
of symbols, called a k-tuple, where k is greater than or equal to two, within an overall set 
of w sequences having sequence numbers 1 , 2, . . w , the symbols being members of an 
alphabet, each sequence of symbols having respective lengths Li, L2 7 . . Lw, by executing 
a method comprising the steps of: 

a) translating the sequences of symbols into a - tabloH>f ordered (symbol, position 
in dex) pairs, where the position index refers to the location of the symbol in a sequence; 

b) for each of the w sequences, grouping the (symbol, position index) pairs by 
symbol to form a respective master offset table, thus creating w master offset tables; 

c) using the position indices in the w master offset tables to determine the 
difference-in-position value be1ween_each_occuJxence 

and eackoccurrence of that same symbol in the other sequence in each master offset table f 
forming a k-tupie table associated with the k-tuple, the table comprising k columns, one of 
the k columns being a primary column and the remaining (k-1) columns being suffix 
^ columns, each column corresponding to one of the k sequences; 

i) the primary column comprising the (symbol, position index) 
pairs of a primary sequence, 

ii) the (k-1) suffix columns comprising (symbol, difference-in- 
position value) pairs, where the difference-in-position values are the 
position differences between all Oke same symbols of each remaining 
sequence of the tuple and the primary sequence of the tuple, 

iii) the rows in the k-tuple table resulting from forming all 
combinations of Kke same symbols from each sequence; 

d) creating a sorted k-tuple table by performing a multi-key sort on the k-tuple 
table, the sort keys being selected respectively from the difference-in-position values of 
the last suffix column (k* column) through the difference-in-position value of the first 
suffix column; and 

e) defining a set-of one or more patterns by collecting adjacent rows of the sorted 
k-tuple table whose suffix columns contain identical difference-in-position values, the 
relative positions of th e symbols in each pattern being determined by the primary column 
position indices, the set of one or more patterns being common to the k sequences. 
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